MR I 5 2DK 8 ] 

SEQUENCE 

LISTING M^OBM^ 



<110 > SZOSTAK, JACK W. 
WILSON, DAVID S. 
KEEFE, ANTHONY D. 

« u0 > «»«»"■•"»- PEPTIDES M ° ° SES 

THEREOF 
<i3 0> 00786/388002 

<14 0> US 10/004,381 
<141> 2001-10-31 

<150> US 60/244,541 
<151> 2000-10-31 

<160> 41 

<170 > FastSEQ for Windows Version 4.0 

<210> 1 
<211> 101 

«ti«i=i.l sequence 



:"° 3 ; selected peptide 

Met MP Olu We Thr His cy J0 ^ hu 

pro Olu Leu H.S Al. Glu „ ^ m1 Trp 



Arq Leu Pro Pro Ser 
y 100 



<210> 2 
<21l> 1 Q1 

Sif icUl seance 

S£ selected peptide 



<2 23> selected peptxae 

Thr His cys Phe His Pro Gly Asp His Leu Val ^ 
Met Asp Glu Lys Thr His cy 1Q 
1 



-1- 



Leu Val 


Glu 


Glu 


Leu 


Gin 


Ala 


Leu Ala Glu 


Gly Leu Gin Arg Gin Gly 






20 








25 


30 


Gly Arg 


Gin 


Pro 


His 


Arg 


Leu 


Pro Arg Arg 


Arg Pro His His Leu Gin 




35 










40 


45 


Leu Leu 


Leu 


Asp 


Glu 


Ala 


His 


Pro Gin Ala 


Gly Pro Leu Arg Glu Arg 


50 










55 




60 


Ala His 


Gin 


Val 


Asp 


Gly 


Arg 


Leu Leu Leu 


Gin His His Pro Gin Gly 


65 








70 






75 80 


Asp Arg 


Leu 


Leu 


Gin 


Gin 


Pro 


Gin Asp His 


Pro Leu Glu Leu Val Trp 








85 






90 


95 


Arg Leu 


Pro 


Pro 


Ser 














100 













<210> 3 

<211> 101 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> selected peptide 

<400> 3 



Met Thr 


Arg 


Arg 


Pro 


Thr 


Ala 


Ser Ser Ser 


Ser Cys Val Arg His Leu 


1 






5 






10 


15 


Leu Leu 


Arg 


Gin 


Gly 


Glu 


His 


Gly His Gin 


Ala Leu Glu Asp Arg Asp 






20 








25 


30 


Lys Ala 


Arg 


His 


Val 


Arg 


Leu 


Val Glu Gly 


Asp Val Glu Val Leu Gly 




35 










40 


45 


Gly Leu 


Asp 


Arg 


Leu 


Ala 


Arg 


Ala Arg His 


Glu Ala Leu His Pro Gin 


50 










55 




60 


Ala Gly 


Leu 


Val 


His 


Leu 


Pro 


Leu His Gly 


Gly Asp Leu Gly Gly His 


65 








70 






75 80 


Leu Arg 


Leu 


Val 


Leu 


Glu 


Ala 


His Pro Gin 


Gly Asp Arg Leu Gly Leu 








85 






90 


95 


Ala Val 


His 


His 


His 














100 













<210> 4 
<211> 102 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> selected peptide 
<400> 4 

Met Asp Glu Lys Thr His Trp Gly He Ser Thr Trp Arg Gly Glu Pro 

15 10 15 

Leu Leu His His Pro Gin Ala Gly Arg Leu Pro Leu Asp Arg Arg Arg 

20 25 30 

Ala Arg His Arg Arg He Leu Gly Ala Glu Pro Gly Gly Val Asp His 

35 40 45 

Gly Leu Arg Leu Glu Leu Leu Asp Asp His Arg Pro Leu Val Pro Asp 

50 55 60 

His His Pro Gin Arg Gly Pro Leu Gin Arg Gly Asp Leu Pro Gin Val 
65 70 75 80 



-2- 



L eu Arg His Ma His Val Leu Gly - Gly Leu 
val Pro Leu Val Arg Leu 
Ala Ma Ma Thr He Thr 



<210> 5 
<211> 102 



Artificial Sequence 
fas?' selected peptide 



«»i Tvr His Pro Gin Gly Asp 
<400> 5 „ is Trp Val Asn Val Tyr n 15 

Met Asp Glu Lys Thr His „ Leu His As p Gin 

Gly Leu His ^in Leu ~> - JJ> ^ _ ^ «, Leu Val P ro 



Gin Hi- — - 85 



<210> 6 
<211> 102 

^ Artificial Sequence 



: 2 22°3 > > selected peptide 

phe Glu Glu Leu Leu Ala 
<400> 6 „ is Trp L eu Asn Asn Pne i» 15 

Met Asp Glu Lys Thr His asd His Pro Leu Val Leu Arg 

1 „ t u Arg Glu Gly Glu Asp His 3Q 

Arg Leu Asp Gly Leu Arg 25 ftrg His 

20 , r iv Leu Leu Asp Gin w« 

His His Pro Gin Gly Asp Gly Leu 45 ^ 

w s, «. «. « - Giy sl ; : » «. «, - - - 

, „. e Glv Val H1 95 
Leu Asp Gly ^u Val His Val 

C ys Gly Cys His His His 



<210> 7 
<211> 101 



-3- 



Sill Artificial Sequence 
<223> selected peptide 



<4°°> 7 j Thr His Tr P Phe Gly Thr Leu Asn Ser Phe Pro Thr 
Met Asp Glu Lys Thr His 10 phe Asn 

Met Ser Ala Val Gly Asn Gly Lys He Asp Cys Ser 

a tpu Ser Leu Asn His Trp Leu 45 
Me t Asn Leu 4 o ^ Leu val Gly Mg 

„. Leu «P asp Oin Leu His ^ ^ 

50 n „=i rln Ala Leu Arg Leu Glu biy ^ 8Q 

„p ASP Gly »1 «1 Oin 

Ar 9 Leu AU Bin AT9 Arg » £ 9 



A rq Leu Pro Pro Ser 
y 100 



<210> 8 
<211> lO 2 

'Si: Smci.1 sequence 



<220> , . . j_ 

<223> selected peptide 

« 00 > 8 s Thr His Cys Thr He Glu Leu Asn Phe Ser Phe Thr 

Met Asp Glu Lys Thr His y 

rvs Leu His His His Pro Gin Gly Asp Ala Leu L 
His Trp L ys Leu H ^ ^ ^ _ Asp 

Gly val A ? Pro His £ _ Ma Arg 

Gln Gly Leu Gly His Ar 9 Ar 9 Gly ^ 

50 , tr i t^i Glu Gly Leu Val Glu Ary 8Q 

Mg asp Pro Glu V.1 ^eu „ ^ ^ ^ 

Si X, Leu Arg His Sly By <*> « 



85 

Trp Arg Leu Pro Pro Ser 
100 



<210> 9 
<211> lO 2 

till Artificial Sequence 



<2«> selected peptide 

«°°> » T Thr His Cys Asn Thr Gly Leu Tyr Asp Gly Ala Ala 
Met Asp Glu Lys Thr His cy 



-4- 



10 15 
^p cys Phe »n Glu Leu Asn Lys Asp val A!. Pro Leu -1 01» Gly 

Arg „ is Asp Leu Val Glu Gly Leu Leu Leu Glu Ar 9 His Pro Gin 0!, 

r 5 v„i Ala His Arg Gin Leu Val His His Pro Leu Leu Gly 
Asp Pro Leu Val Ala his aiy « gQ 

Arg lly Glu Ar g His « g Ar g Ala «u Vsl Pro Gin Gin Gin His Gin 
£. His Ar g Leu Gin Pro val val asp Leu Gly Ar g Ar g «, Leu val 

Trp Arg Leu Pro Pro Ser 
100 



<210> 10 
<211> 103 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> selected peptide 



<400> 10 x Arg Ala Gin Glu Leu Val Gly 

Met Asp Glu Lys Thr His Trp His biu nry ig 

G i y L eu Leu Leu hL Asp His Pro Gin Arg Leu Leu Leu Glu Pro Arg 
Gly Pro Arg fro Leu Arg Gi y Leu Val His Glu Arg Gly His Gin Pro 

r 5 a i = civ Arq Val Glu Glu Ala Asp Gly Gly Leu Leu Arg 
Gin Pro Leu Ala Gly Arg vax gQ 

ftsp Gly Gly Gly Glu Leu Glu Pro Leu Val «g Glu Giy Gin Asp His 
6 Len Glu Pro Len Asp Asp Glu Leu Asp Ala Gly Pro Ar g Gly Leu Val 



85 

Trp Arg Leu Pro His His His 
100 



<210> 11 
<211> 102 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> selected peptide 



; 4 e?°Asp l Glu Lys Thr His rrp His Glu Ar g Val His His Leu Ala Asp 
a \ y Leu Gin Gin As Pro Gin Gly Gin Are Ar g Pro Leu Val Glu Ar g 
„ is „ Gin val Pro Ar 9 Gly Len Val Ar g Glu Leu Gin His Gin Giy 
Lsu Pro 2. Glu His Pro Ala Gly Val Hi. val II. Ar g Len His Gin 
Gly Asp Asp Arg Asp Val Asp Gly Leu val Asp Gly His Gly Ar g Asp 



-5- 



75 His Arq Leu Val 

70 t ASP Gly Pro HlS 9 q 5 

6 , 5 S1 « 9 «y - T " 9 ° lu V " ' » 



<210> I 2 
<211> 101 



Stifici^ Sequence 
selected peptide 

Glu Glu Leu Arg Olu Arg 
<400> 12 pro Leu Le u Glu Glu Leu Glu 

M t Asp L Y s Asp Pro ^ ^ ^ Arg f oln 

i Val His His Pro Gin . ^ ^ ^ _ Arg 

Gly His ASP Ala Glu Arg J ^ His His val 

GlY Leu feu Asp Glu Pro Gin Arg > ^ ^ ^ „ 

pro S. « val Gly Gin Arg ^ - „ - Gln Arg Leu Val T rp 
Su Gly Leu Leu Asp Gin Arg Asp gQ 



« LeU Pr ° fo°o 



<210> 13 
<211> 1° 2 



:l\T> Artificial Sequence 

- SeleCtedP6Ptide , Le u Arg Glu Leu Arg Glu 

^»^ZZZtZ^^ 

M p 8l y 31, d» 

~ «, 2. - - »" ST W o ^ Gly S. His Olu Oly « 
GW », SI, - » - 7 X Hi3 Gin «, « « ^ 

o? y «. «*r - T ™ V " K ° y - 

Trp «g Pro Pro ser 



<210> 14 

-6- 



<211> 101 

till Artificial Sequence 



lilt selected peptide 

T Q11 q P r phe Asn 
Thr His A rg Thr Leu Ser Val Ser Leu Ser 
Met Asp Glu Lys Thr Hxs Ar9 10 ^ Qlu oly Leu 

1 pt„ rin Thr Lys Ala Cys Trp Arg 3Q 

» W I? , val », G1 u His 01» V,! »P V! «» 

Hi3 aly His Pro ol„ civ v f ^ Mp Gly 

Pro Le u iL slu «• - cln o» v. a £ 

50 _ nl ^ r -Leu Leu His Arg ha<* 80 

v ?1 «„ - « - « ^ , L oiy „ Leu f „ 

His Pro Leu Pro Asp Hxs Ala Gly gQ 



Arg Leu Pro Pro Ser 
Ary 100 



<210> 15 
<211> 101 

Artificial sequence 



: 2 22°3> selected peptide 

t ,o nv Val Leu Lys 
<400> 15 T . His Trp L eu Glu Asp Leu Lys e y i5 

Met Asp Glu Lys Thr ^ Ser pro 

1 t pu Met Asp P^e inr 3Q 

Mp cys Leu Lys MP 

t rin Pro Leu Leu His nx jr 
K9 val Gin Pr° ° ln 40 x Leu Gly Pro «9 

Pro WU 2. «. - »• Rrg * sp L ; u s j y «, Hi . MP «. ro s 

So « ----- f/ G1 pro Le „ v,! °cye 

Glu Pro Leu v »l Gln ASP HiS Pt ° 90 



85 

Gly cys His His His 



<210> 16 
<211> 102 

: 2 2 1 X 2 3 > > Artificial Sequence 
: 2 22 0 3> selected peptide 
<400> 16 



-7- 





wX U 




Th y 

J. 1 J.X 


Hi q 


Trp Val 


i/cu uin ucu ZiJL 0 


ny O-LH uxy nsy 








5 






10 


15 


riXy uCU 


vjx y 


Pyn 






Glv Glv 

vj j. y \j jl y 


&qn Sen 1 Am 


Lpn v^i niv mn 






20 








25 


30 


n~\ \r ril 11 

ijiy L7iu 


vjx y 


val 


.Leu 


vjX u 


oiy ucu 


>iD|j vjxy niy rxu 


7\ ye* Ay<~r y rr Syn 
/ix y Hiy rti y i-ixy 




35 








40 




45 


His Arg 


Leu 


Pro 


Arg 


Glu 


Asp Glu 


His Arg Val Arg 


Ala Leu Val Asp 


50 










55 


60 




Gin Val 


Arg 


Asp 


Leu 


Ala 


Glu Arg 


Leu Val Glu Glu 


Val Asp Gly Gly 


65 








70 




75 


80 


Val Glu 


Ala 


Leu 


Arg 


His 


Leu Gly 


Leu Pro Gin Asp 


Glu Pro Arg Ser 








85 






90 


95 


Gly Gly 


Cys 


His 


His 


His 












100 













<210> 17 

<211> 102 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> selected peptide 

<400> 17 



Met Asp 


Glu 


Lys 


Thr 


His 


Trp Val 


Gly Asp Leu Gin 


Glu Pro Leu Gly 


1 






5 






10 


15 


Pro Leu 


His 


Gly 


Gly 


Val 


Gly Glu 


Val Pro Gly Gly 


Leu Val Leu Arg 






20 








25 


30 


His His 


Pro 


Gin 


Arg 


Asp 


Arg Leu 


Val Asp Gly Val 


Gly Pro His Gly 




35 








40 




45 


Arg Ala 


Leu 


Ala 


Arg 


Arg 


Pro His 


Arg Val Val Glu 


Gly Leu His His 


50 










55 


60 




Leu Leu 


Gin 


Arg 


Gly 


Gly 


Glu Arg 


Leu Pro Pro Asp 


Gly Pro Arg Gin 


65 








70 




75 


80 


Leu Gly 


Leu 


Leu 


Gly 


Gly 


Glu Leu 


Asp Arg Ala Asp 


Pro Ala Leu Val 








85 






90 


95 


Trp Arg 


Leu 


Pro 


Pro 


Ser 












100 













<210> 18 

<211> 101 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> selected peptide 

<400> 18 



Met Asp Glu Lys Thr His 


Cys Ala 


Val 


Asn 


Val 


Asn 


Val 


Gly Leu Thr 


1 5 








10 








15 


His Trp Cys His Arg Val 


Ala 


His 


Leu 


Gin 


Pro 


Leu 


Asp 


Pro His Pro 


20 






25 










30 


Gin Gly Asp His Leu Arg 


Leu 


Glu 


Pro Leu Gly His 


Ala 


Leu Val Asp 


35 




40 










45 




Pro Leu Val Gin Gly Val 


Glu 


Glu 


Val 


Val 


Arg 


Pro 


Leu Gin Leu Asp 


50 


55 










60 







-8- 



val m »X «- «■ - - «. va, «. f n V. «- - g 
vai u y 70 , „. rlv Ma Leu Val Trp 

65 „ <c r-lu Ala Gly Gin Ala His Gly Aia u 

Glu Gly Leu Asp His Glu Aia y gQ 9b 



85 

Arg Leu Pro Pro Ser 



<210> 19 
<211> 101 

<213> Artificial Sequence 



<223> selected peptide 

< 400> 19 n t s Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
Met Asp Glu Lys Thr Thr u y 1Q . His Pro 

1 „ fliy Glu Leu Glu Gin Leu Arg Ala Arg Leu Glu His 
L eu Ala Gly Glu ^ ^ Qlu 

G - ^ " ^ ZZlZ val Ala Gly 2. ~ As P Pro 
Gly ^ Val Gin Asp « ^ ^ ^ _ ^ 

val Glu Lys Leu Leu Thr Asp 

65 |. i. Ihr Phe Tyr Leu Glu Met Tyr Asp ir P 

Lys Asp Cys Lys Met Thr Pne y gQ 
8 5 

Gly Cys His His His 
100 

<210> 20 
<211> 102 

<213> Artificial Sequence 
<223> selected peptide 

< 400> 20 T Thr His Cys Lys Leu Asn Phe Lys Val Asn lie Ala 
T ^ T J Gly Qly » y Gln Gly Leu Leu Gly Arg 

rr tp„ Ala Glu Phe His Gly Gly <^Y 3Q 
Asp Trp Leu Ala ^ ^ ^ Val 

*=o rlv Val Val Gin Arg Leu Val Asp y 
Arg Asp Gly vai 4Q Leu 

85 

Trp Arg Leu Pro Pro Ser 



-9- 



<210> 21 
<2H> 1 Q1 

f^!?: Artificial sequence 



: 2 22 0 3> selected peptide 

<«o> - t Thr T hr Oly Arg Oly oly His Val val Olu oiy 
Met Asp Glu Lys Thr 0 His His Pro 

Gly v. «. -p - ^ « ws m „ yal 

v.l Glu Lys Leu Leu Thr W « ser Qly 

5. »p cy= Lys « « - ^ « fo" 



Gly Cys Lys Leu Gly 
y 100 



<210> 22 
<211> 8 9 

f2 1 1 2 3; Artificial Sequence 



lilt selected peptide 

«°0> 22 t Thr Thr Gly Trp Arg Gly Gly His val Val Glu Gly 
Met Asp Glu Lys Thr 0 His His Pro 

1 , m„ Leu Glu Gin Leu Arg Ala Arg 

"» " 3 £" , val |t„ Olu u.l ulu »P « » G1 " 

Gln 01y oi» » ™> "» «, Leu Mp Pr o 

3 5 Glv val Val Ala Gly 

Gly wu V,! Gin MP L- H s Sly „ v ^ 

50 Asr) Trp Phe Lys Lys Pne uy gQ 

Val Glu Lys Leu Leu Thr Asp P ?5 

M 5 et Met ser Gly Gly Cys Lys Leu Gly 



<210> 23 

IE Sificiul sequence 



<"°3» selected peptide 

»»lu Lys . ~ oiy » « - oly His us! us! olu oly 



-10- 



5 l?a Arg Leu Olu His hL Pro 

l r nu Gin Leu Arg Ala Arg 

L X eu A la Gly Olu Leu Olu 

aly Leu V.1 oln »P Leu ? 5 S t " 

J - u „ et « « -V «V <*• - - 

65 



<210> 24 
<2H> 61 



2"; Si""- 1 s «" ence 

^ - ? ™ Gly w " 9 E I ~ oxu *. S. - 

1 T riu Gin Leu Arg Ala Arg 3Q 

Gin Gly cm Arg Glu 4Q 

Gly « v 5 ,, «. « - tr Gly Gly * 60 



<210> 25 
<211> 47 



Artificial sequence 
: 2 2 2 23> selected peptide 

Gln Gly Gin Arg Glu vro ^ 



<210> 26 
<211> 47 

SI 'Artificial Sequence 



: 2 22°3> selected peptide 



-11- 



Leu Ala Gly Glu 
20 

Ala Gly Gin Arg 
35 



Leu Glu Gin Leu 

Glu Pro Met Met 
40 



Arg Ala Arg Leu 
25 

Ser Gly Gly Cys 



Glu His His Gly 
30 

Lys Leu Gly 
45 



<210> 27 

<211> 39 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> selected peptide 

^°Lp 7 <ny His val v.i giu Gly Leu Ala Gly «. - «» °f "» 

i Ala Arg Leu Gru His His » Gin Gly Oin Arg OIU pro Met Met 

20 ^ 
Ser Gly Gly Cys Lys Leu Gly 
35 



<210> 28 

<211> 89 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> selected peptide 



ST «Y «~ »° Gly Glu Leu Glu Giu Leu Arg «• « ™ 

G L His His Pro gL Gly Gin Arg Glu Pro Leu Vai Gin Glu v,i Glu 

20 , Ten His Glv Val Val Ala Gly 

Asp Val Asp Glu Gly Leu Val Gin Asp Leu His Gly ^ 

35 , To „ Thr Asr> Trp Phe Lys Lys Phe 

Le u Leu Asp Pro Val Glu Lys Leu Leu Thr Asp Trp 

50 T T ur Phe Tvr Leu Glu Met Tyr 

Lys Asn Val Ser Lys Asp Cys Lys Met Thr Phe Tyr &q 

70 

Is? Trp Ser Gly Gly Cys Lys Leu Gly 



<210> 29 

<211> 29 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> selected peptide 



Sr t 0 Gl„\eu Glu Gin Leu Arg Ala Aig Leu Glu His His Pro Gin Gly 
oln Arg Glu Pro Met Met Ser Gly Gly Cys Lys Lea Gly 



-12- 



20 



25 



<210> 30 
<2U> 35 

f 2 f 3 ; Artificial Sequence 
<220> 

<223> primer 

<210> 31 
<211> 35 

fill Sit icial Sequence 
<220> 

<223> primer 

< 400> 31 acuacaag cuuggcaccg gcuau 
acuggucugg cggcugcaag 

<210> 32 
<2H> i 4 

Artificial Sequence 



f 2 lT> selected peptide 

-Se^OlV T Hie Hi. His Hi, Hie Hie Ser Ser 



<210> 33 
<2U> 11 

:l X H Artificial Sequence 



tit designed peptide 

CsefdV 01V CVS «■ - 01V - ^ 



<210> 34 
<2H> 44 

: 2 2 1 1 2 3 > > Artificial Sequence 
tit designed peptide 

-13- 



- «■ •» gi - Ma oiy s 8 val RBP Gln Ala " p T 

X - V al G1 „ Gin 0 ly * - £. L. *. ~ - f 0 V - « 

„ et „et s„ Sy «y cy S W - - m Tyr 

35 



<210> 35 

<211> 38 

<212> PRT 

<213> Artificial 



Sequence 



<220> , , 

<223> selected peptide 



r U*«u * ~ «, «, fo v -y - v* v. - 

Leu U , «, «. - «» «- - - - - - J' **" 



20 

Gin Gly Gin Arg Glu Pro 
35 



<210> 36 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

<220> ^ . - 

<223> designed peptide 



Me^efser Gly Gly Cy S Lys Leu Gly 



1 



<210> 37 
<211> 6688 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid 



is sss Hi ill 111 pi 

tgcatttcac acaagggtcc ^gctacctc 9999 ctccag tcgt 9«cgaagtt 420 

3£SS SEES SSSi S9 "° 9cacs 9=atac 



-14- 



aagcggacat ctgcgggatg ttcggcatga tttcaccttt ctgggcgttt tccattgtgg 660 
cggcaatacg tggatctttc gccaactctt cctcgtaaga cttcagcgct acggcaccca 720 
gcggtttgtc tttattaacc gcttccagac cttcatcagt cagcagatag ttttcgagga 780 
actcttttgc cagctctttg ttcggactgg cggcgttaat acctgcgctc agcacgccaa 840 
cgaacggttt ggatggttga cccttgaagg tcggcagtac cgttacacca taattcactt 900 
tgctggtgtc gatgttggac catgcccacg ggccgttgat ggtcatcgct gtttcgcctt 960 
tattaaaggc agcttctgcg atggagtaat cggtgtctgc attcatgtgt ttgtttttaa 1020 
tcaggtcaac caggaaggtc agacccgctt tcgcgccagc gttatccacg cccacgtctt 1080 
taatgtcgta cttgccgttt tcatacttga acgcataacc cccgtcagca gcaatcagcg 1140 
gccaggtgaa gtacggttct tgcaggttga acatcagcgc gctcttacct ttcgctttca 1200 
gttctttatc cagcgccggg atctcttccc aggtttttgg cgggttcggc agcagatctt 1260 
tgttataaat cagcgataac gcttcaacag cgatcgggta agcaatcagc ttgccgttgt 1320 
aacgtacggc atcccaggta aacggataca gcttgtcctg gaacgctttg tccggggtga 1380 
tttcagccaa caggccagat tgagcgtagc caccaaagcg gtcgtgtgcc cagaagataa 1440 
tgtcagggcc atcgccagtt gccgcaacct gtgggaattt ctcttccagt ttatccggat 1500 
gctcaacggt gactttaatt ccggtatctt tctcgaattt cttaccgact tcagcgagac 1560 
cgttatagcc tttatcgccg ttaatccaga ttaccagttt accttcttcg attcccatgg 1620 
tatatctcct tcttaaagtt aaacaaaatt atttctagag gggaattgtt atccgctcac 1680 
aattccccta tagtgagtcg tattaatttc gcgggatcga gatctcgatc ctctacgccg 1740 
gacgcatcgt ggccggcatc accggcgcca caggtgcggt tgctggcgcc tatatcgccg 1800 
acatcaccga tggggaagat cgggctcgcc acttcgggct catgagcgct tgtttcggcg 1860 
tgggtatggt ggcaggcccc gtggccgggg gactgttggg cgccatctcc ttgcatgcac 1920 
cattccttgc ggcggcggtg ctcaacggcc tcaacctact actgggctgc ttcctaatgc 1980 
aggagtcgca taagggagag cgtcgagatc ccggacacca tcgaatggcg caaaaccttt 2 040 
cgcggtatgg catgatagcg cccggaagag agtcaattca gggtggtgaa tgtgaaacca 2100 
gtaacgttat acgatgtcgc agagtatgcc ggtgtctctt atcagaccgt ttcccgcgtg 2160 
gtgaaccagg ccagccacgt ttctgcgaaa acgcgggaaa aagtggaagc ggcgatggcg 2220 
gagctgaatt acattcccaa ccgcgtggca caacaactgg cgggcaaaca gtcgttgctg 2280 
attggcgttg ccacctccag tctggccctg cacgcgccgt cgcaaattgt cgcggcgatt 2340 
aaatctcgcg ccgatcaact gggtgccagc gtggtggtgt cgatggtaga acgaagcggc 2400 
gtcgaagcct gtaaagcggc ggtgcacaat cttctcgcgc aacgcgtcag tgggctgatc 2460 
attaactatc cgctggatga ccaggatgcc attgctgtgg aagctgcctg cactaatgtt 2520 
ccggcgttat ttcttgatgt ctctgaccag acacccatca acagtattat tttctcccat 2580 
gaagacggta cgcgactggg cgtggagcat ctggtcgcat tgggtcacca gcaaatcgcg 2 640 
ctgttagcgg gcccattaag ttctgtctcg gcgcgtctgc gtctggctgg ctggcataaa 2700 
tatctcactc gcaatcaaat tcagccgata gcggaacggg aaggcgactg gagtgccatg 2760 
tccggttttc aacaaaccat gcaaatgctg aatgagggca tcgttcccac tgcgatgctg 2820 
gttgccaacg atcagatggc gctgggcgca atgcgcgcca ttaccgagtc cgggctgcgc 2880 
gttggtgcgg atatctcggt agtgggatac gacgataccg aagacagctc atgttatatc 2 940 
ccgccgttaa ccaccatcaa acaggatttt cgcctgctgg ggcaaaccag cgtggaccgc 3000 
ttgctgcaac tctctcaggg ccaggcggtg aagggcaatc agctgttgcc cgtctcactg 3060 
gtgaaaagaa aaaccaccct ggcgcccaat acgcaaaccg cctctccccg cgcgttggcc 3120 
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 318 0 
cgcaattaat gtaagttagc tcactcatta ggcaccggga tctcgaccga tgcccttgag 3240 
agccttcaac ccagtcagct ccttccggtg ggcgcggggc atgactatcg tcgccgcact 3300 
tatgactgtc ttctttatca tgcaactcgt aggacaggtg ccggcagcgc tctgggtcat 3360 
tttcggcgag gaccgctttc gctggagcgc gacgatgatc ggcctgtcgc ttgcggtatt 3420 
cggaatcttg cacgccctcg ctcaagcctt cgtcactggt cccgccacca aacgtttcgg 3480 
cgagaagcag gccattatcg ccggcatggc ggccccacgg gtgcgcatga tcgtgctcct 3540 
gtcgttgagg acccggctag gctggcgggg ttgccttact ggttagcaga atgaatcacc 3600 
gatacgcgag cgaacgtgaa gcgactgctg ctgcaaaacg tctgcgacct gagcaacaac 366 0 
atgaatggtc ttcggtttcc gtgtttcgta aagtctggaa acgcggaagt cagcgccctg 3720 
caccattatg ttccggatct gcatcgcagg atgctgctgg ctaccctgtg gaacacctac 3780 
atctgtatta acgaagcgct ggcattgacc ctgagtgatt tttctctggt cccgccgcat 3840 
ccataccgcc agttgtttac cctcacaacg ttccagtaac cgggcatgtt catcatcagt 3900 
aacccgtatc gtgagcatcc tctctcgttt catcggtatc attaccccca tgaacagaaa 3960 
tcccccttac acggaggcat cagtgaccaa acaggaaaaa accgccctta acatggcccg 402 0 
ctttatcaga agccagacat taacgcttct ggagaaactc aacgagctgg acgcggatga 4080 
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acaggcagac atctgtgaat cgcttcacga 
cgcgcgtttc ggtgatgacg gtgaaaacct 
agcttgtctg taagcggatg ccgggagcag 
tggcgggtgt cggggcgcag ccatgaccca 
cttaactatg cggcatcaga gcagattgta 
taccgcacag atgcgtaagg agaaaatacc 
ctgactcgct gcgctcggtc gttcggctgc 
taatacggtt atccacagaa tcaggggata 
agcaaaaggc caggaaccgt aaaaaggccg 
cccctgacga gcatcacaaa aatcgacgct 
tataaagata ccaggcgttt ccccctggaa 
tgccgcttac cggatacctg tccgcctttc 
gctcacgctg taggtatctc agttcggtgt 
acgaaccccc cgttcagccc gaccgctgcg 
acccggtaag acacgactta tcgccactgg 
cgaggtatgt aggcggtgct acagagttct 
gaaggacagt atttggtatc tgcgctctgc 
gtagctcttg atccggcaaa caaaccaccg 
agcagattac gcgcagaaaa aaaggatctc 
ctgacgctca gtggaacgaa aactcacgtt 
tctgcttaca taaacagtaa tacaaggggt 
tgctctaggc cgcgattaaa ttccaacatg 
cgcgataatg tcgggcaatc aggtgcgaca 
ccagagttgt ttctgaaaca tggcaaaggt 
gtcagactaa actggctgac ggaatttatg 
actcctgatg atgcatggtt actcaccact 
ttagaagaat atcctgattc aggtgaaaat 
cggttgcatt cgattcctgt ttgtaattgt 
gctcaggcgc aatcacgaat gaataacggt 
cgtaatggct ggcctgttga acaagtctgg 
ccggattcag tcgtcactca tggtgatttc 
aaattaatag gttgtattga tgttggacga 
gccatcctat ggaactgcct cggtgagttt 
aaatatggta ttgataatcc tgatatgaat 
tttttctaag aattaattca tgagcggata 
aataggggtt ccgcgcacat ttccccgaaa 
tttgttaaaa ttcgcgttaa atttttgtta 
aatcggcaaa atcccttata aatcaaaaga 
agtttggaac aagagtccac tattaaagaa 
cgtctatcag ggcgatggcc cactacgtga 
gaggtgccgt aaagcactaa atcggaaccc 
gggaaagccg gcgaacgtgg cgagaaagga 
ggcgctggca agtgtagcgg tcacgctgcg 
gccgctacag ggcgcgtccc attcgcca 



ccacgctgat gagctttacc gcagctgcct 4140 
ctgacacatg cagctcccgg agacggtcac 4200 
acaagcccgt cagggcgcgt cagcgggtgt 4260 
gtcacgtagc gatagcggag tgtatactgg 432 0 
ctgagagtgc accatatatg cggtgtgaaa 4380 
gcatcaggcg ctcttccgct tcctcgctca 4440 
ggcgagcggt atcagctcac tcaaaggcgg 4500 
acgcaggaaa gaacatgtga gcaaaaggcc 4560 
cgttgctggc gtttttccat aggctccgcc 462 0 
caagtcagag gtggcgaaac ccgacaggac 4680 
gctccctcgt gcgctctcct gttccgaccc 4740 
tcccttcggg aagcgtggcg ctttctcata 4800 
aggtcgttcg ctccaagctg ggctgtgtgc 4860 
ccttatccgg taactatcgt cttgagtcca 4920 
cagcagccac tggtaacagg attagcagag 4980 
tgaagtggtg gcctaactac ggctacacta 5040 
tgaagccagt taccttcgga aaaagagttg 5100 
ctggtagcgg tggttttttt gtttgcaagc 5160 
aagaagatcc tttgatcttt tctacggggt 5220 
aagggatttt ggtcatgaac aataaaactg 5280 
gttatgagcc atattcaacg ggaaacgtct 5340 
gatgctgatt tatatgggta taaatgggct 5400 
atctatcgat tgtatgggaa gcccgatgcg 5460 
agcgttgcca atgatgttac agatgagatg 5520 
cctcttccga ccatcaagca ttttatccgt 5580 
gcgatccccg ggaaaacagc attccaggta 5640 
attgttgatg cgctggcagt gttcctgcgc 5700 
ccttttaaca gcgatcgcgt atttcgtctc 5760 
ttggttgatg cgagtgattt tgatgacgag 5820 
aaagaaatgc ataaactttt gccattctca 5880 
tcacttgata accttatttt tgacgagggg 5940 
gtcggaatcg cagaccgata ccaggatctt 6000 
tctccttcat tacagaaacg gctttttcaa 6060 
aaattgcagt ttcatttgat gctcgatgag 6120 
catatttgaa tgtatttaga aaaataaaca 6180 
agtgccacct gaaattgtaa acgttaatat 6240 
aatcagctca ttttttaacc aataggccga 6300 
atagaccgag atagggttga gtgttgttcc 6360 
cgtggactcc aacgtcaaag ggcgaaaaac 6420 
accatcaccc taatcaagtt ttttggggtc 6480 
taaagggagc ccccgattta gagcttgacg 6540 
agggaagaaa gcgaaaggag cgggcgctag 6600 
cgtaaccacc acacccgccg cgcttaatgc 6660 

6688 



<210> 38 
<211> 479 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Designed protein 
<400> 38 

Met Gly lie Glu Glu Gly Lys Leu Val He Trp He Asn Gly Asp Lys 
15 10 15 
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01y w Asn «, « - «- v., Gly Lys - «■ SJ- - * 

r y xl Lys v°,l Thr Va! «u », "o Asp Lys « «« - - 
Gly J-J- e ^ 40 . T , Trp Ala 

pro Gln val Ma Ala Thr f V Asp OXV Pro Asp XI. - P 

50 , Phe Gly Gly Tyr Ala Gin ser Gly Leu Leu Ma Glu lie 
His Asp Arg Phe Gly Gly 

Sr P- Asp Lys Ala Phe Gin Asp Lys Leu Tyr 

85 ^_ Xvr pro He Ala Val Glu 

», V,! «, Tyr » «1V W - | U0 

10 ti. Tyr Mn Lys Asp Leu Leu Pro Asn Pro Pro Ly 
». Leu S.r LSS US Tyr As y g ^ ^ ^ 

Thr Trp Glu Glu He Pro Ala Leu Asp uy 

Z Ala L.u Me, Ph. S Leo G!n - Pro Tyr Ph. Thr Trp Pro 

n. Lys Asp V.1 Gly va! Asp As» Ala Gly Ala Lys Ala Gly 
Tyr asp He Lys Asp Mn ua Mp 

L .u Thr Phe Leu Val Asp Leu U. Lys y 

xhr Asp Ty S.r He Ala Glu Al. Ala Phe Asn Lys Gly 

» S ! A „ Gly Pro SS Ala Trp s.r Asn He Asp Thr ser Lys 
„« Thr ,1. n G wo 2£ ws My 

v.1 A.n Tyr G!y val £ pro 

Lys pr0 Ph e V.1 Gly val Leu Ser A Gly 

2 60 . Ttt „ pi,, Asn Tyr Leu Leu Tnr a&p 

M „ Lys Glu Leu Ala Lys Glu Phe Leu «. » 

01u Gly Leu Glu Ala Va! Asn Lys Asp Lys Pro eu Gly 

H ser Tyr Glu Glu III Leo Al, Lys Asp Pro Ar 9 He Al, Ala 

SS ^ Rla ™ W s GXy Glu Ue Met Pro Asn He Pro G!n 

Thr Met Glu Asn Al, Gin Lys y „, ^ 

- s - ~ » Trp Ty ; :; ; - ^ „ a * - 

seI 01y Ag Gin Thr V,l A, ^ ^ ^ ^ ^ 

^ Ala Arg Leu III His His Pro Gin Gly Gin Arg Glu Pro Gly 

L eu Arg Ala Arg Leu ^ ^ ^ ^ 

Ser Gly His His His His His His Glu 43 

420 Ala pro Arg Asn Trp Lys Val 

MPt- asp Pro Cys Val Lys Cys Lys Val Ala vr a 

Met Asp fro / 440 Asn 

Lys A sn Ly His Leu Arg He Tyr Asn Met Cys Lys Thr Cys 

S Xle Gly Asp Asp Thr Tyr His Gly His Asp Asp 

Asn Ser He Asp He Gly Asp * p ^ 

465 
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<210> 39 
<211> 47 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Selected peptide 
<400> 39 

Met Asp Pro Cys Val Lys Cys Lys Val Ala Pro Arg Asn Trp Lys Val 

15 10 15 

Lys Asn Lys His Leu Arg He Tyr Asn Met Cys Lys Thr Cys Phe Asn 

20 25 30 

Asn Ser He Asp He Gly Asp Asp Thr Tyr His Gly His Asp Asp 
35 40 45 



<210> 40 
<211> 6544 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid 
<400> 40 

atccggatat agttcctcct ttcagcaaaa 
ggggttatgc tagttattgc tcagcggtgg 
tgttagcagc cggatctcag tggtggtggt 
gctgcgaagc ttattagaat tcgtgatgat 
gaccttgtgg gtgatgctcc agtcgtgcac 
caacaacgtg tccacctcgc caaccagtgg 
ctgagctcga attagtctgc gcgtctttca 
cggcgttgat caccgcagta cgcacggcat 
gcatgatttc acctttctgg gcgttttcca 
actcttcctc gtaagacttc agcgctacgg 
ccagaccttc atcagtcagc agatagtttt 
gactggcggc gttaatacct gcgctcagca 
tgaaggtcgg cagtaccgtt acaccataat 
cccacgggcc gttgatggtc atcgctgttt 
agtaatcggt gtctgcattc atgtgtttgt 
ccgctttcgc gccagcgtta tccacgccca 
acttgaacgc ataacccccg tcagcagcaa 
ggttgaacat cagcgcgctc ttacctttcg 
cttcccaggt ttttggcggg ttcggcagca 
caacagcgat cgggtaagca atcagcttgc 
gatacagctt gtcctggaac gctttgtccg 
cgtagccacc aaagcggtcg tgtgcccaga 
caacctgtgg gaatttctct tccagtttat 
tatctttctc gaatttctta ccgacttcag 
tccagattac cagtttacct tcttcgattc 
aaaattattt ctagagggga attgttatcc 
aatttcgcgg gatcgagatc tcgatcctct 
gcgccacagg tgcggttgct ggcgcctata 
ctcgccactt cgggctcatg agcgcttgtt 
ccgggggact gttgggcgcc atctccttgc 
acggcctcaa cctactactg ggctgcttcc 



aacccctcaa gacccgttta gaggccccaa 60 
cagcagccaa ctcagcttcc tttcgggctt 120 
ggtggtgctc gagtgcggcc gcaagcttca 180 
ggtgatggtg accggatcct ggttcacgtt 240 
gaagttgttc aagttcccca gccagtcctt 300 
tcttctcgtc catccctgag ccgctacctc 360 
gggcttcatc gacagtctga cgaccgctgg 420 
accagaaagc ggacatctgc gggatgttcg 480 
ttgtggcggc aatacgtgga tctttcgcca 540 
cacccagcgg tttgtcttta ttaaccgctt 600 
cgaggaactc ttttgccagc tctttgttcg 660 
cgccaacgaa cggtttggat ggttgaccct 720 
tcactttgct ggtgtcgatg ttggaccatg 780 
cgcctttatt aaaggcagct tctgcgatgg 840 
ttttaatcag gtcaaccagg aaggtcagac 900 
cgtctttaat gtcgtacttg ccgttttcat 960 
tcagcggcca ggtgaagtac ggttcttgca 1020 
ctttcagttc tttatccagc gccgggatct 1080 
gatctttgtt ataaatcagc gataacgctt 1140 
cgttgtaacg tacggcatcc caggtaaacg 1200 
gggtgatttc agccaacagg ccagattgag 1260 
agataatgtc agggccatcg ccagttgccg 1320 
ccggatgctc aacggtgact ttaattccgg 13 80 
cgagaccgtt atagccttta tcgccgttaa 1440 
ccatggtata tctccttctt aaagttaaac 1500 
gctcacaatt cccctatagt gagtcgtatt 1560 
acgccggacg catcgtggcc ggcatcaccg 1620 
tcgccgacat caccgatggg gaagatcggg 1680 
tcggcgtggg tatggtggca ggccccgtgg 1740 
atgcaccatt ccttgcggcg gcggtgctca 1800 
taatgcagga gtcgcataag ggagagcgtc 1860 
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gagatcccgg acaccatcga atggcgcaaa 
gaagagagtc aattcagggt ggtgaatgtg 
tatgccggtg tctcttatca gaccgtttcc 
gcgaaaacgc gggaaaaagt ggaagcggcg 
gtggcacaac aactggcggg caaacagtcg 
gccctgcacg cgccgtcgca aattgtcgcg 
gccagcgtgg tggtgtcgat ggtagaacga 
cacaatcttc tcgcgcaacg cgtcagtggg 
gatgccattg ctgtggaagc tgcctgcact 
gaccagacac ccatcaacag tattattttc 
gagcatctgg tcgcattggg tcaccagcaa 
gtctcggcgc gtctgcgtct ggctggctgg 
ccgatagcgg aacgggaagg cgactggagt 
atgctgaatg agggcatcgt tcccactgcg 
ggcgcaatgc gcgccattac cgagtccggg 
ggatacgacg ataccgaaga cagctcatgt 
gattttcgcc tgctggggca aaccagcgtg 
gcggtgaagg gcaatcagct gttgcccgtc 
cccaatacgc aaaccgcctc tccccgcgcg 
caggtttccc gactggaaag cgggcagtga 
tcattaggca ccgggatctc gaccgatgcc 
ccggtgggcg cggggcatga ctatcgtcgc 
actcgtagga caggtgccgg cagcgctctg 
gagcgcgacg atgatcggcc tgtcgcttgc 
agccttcgtc actggtcccg ccaccaaacg 
catggcggcc ccacgggtgc gcatgatcgt 
gcggggttgc cttactggtt agcagaatga 
ctgctgctgc aaaacgtctg cgacctgagc 
ttcgtaaagt ctggaaacgc ggaagtcagc 
cgcaggatgc tgctggctac cctgtggaac 
ttgaccctga gtgatttttc tctggtcccg 
acaacgttcc agtaaccggg catgttcatc 
tcgtttcatc ggtatcatta cccccatgaa 
gaccaaacag gaaaaaaccg cccttaacat 
gcttctggag aaactcaacg agctggacgc 
tcacgaccac gctgatgagc tttaccgcag 
aaacctctga cacatgcagc tcccggagac 
gagcagacaa gcccgtcagg gcgcgtcagc 
gacccagtca cgtagcgata gcggagtgta 
attgtactga gagtgcacca tatatgcggt 
aataccgcat caggcgctct tccgcttcct 
ggctgcggcg agcggtatca gctcactcaa 
gggataacgc aggaaagaac atgtgagcaa 
aggccgcgtt gctggcgttt ttccataggc 
gacgctcaag tcagaggtgg cgaaacccga 
ctggaagctc cctcgtgcgc tctcctgttc 
cctttctccc ttcgggaagc gtggcgcttt 
cggtgtaggt cgttcgctcc aagctgggct 
gctgcgcctt atccggtaac tatcgtcttg 
cactggcagc agccactggt aacaggatta 
agttcttgaa gtggtggcct aactacggct 
ctctgctgaa gccagttacc ttcggaaaaa 
ccaccgctgg tagcggtggt ttttttgttt 
gatctcaaga agatcctttg atcttttcta 
cacgttaagg gattttggtc atgaacaata 
aggggtgtta tgagccatat tcaacgggaa 
aacatggatg ctgatttata tgggtataaa 
gcgacaatct atcgattgta tgggaagccc 



acctttcgcg gtatggcatg atagcgcccg 1920 
aaaccagtaa cgttatacga tgtcgcagag 1980 
cgcgtggtga accaggccag ccacgtttct 2040 
atggcggagc tgaattacat tcccaaccgc 2100 
ttgctgattg gcgttgccac ctccagtctg 2160 
gcgattaaat ctcgcgccga tcaactgggt 2220 
agcggcgtcg aagcctgtaa agcggcggtg 2280 
ctgatcatta actatccgct ggatgaccag 2340 
aatgttccgg cgttatttct tgatgtctct 2400 
tcccatgaag acggtacgcg actgggcgtg 2460 
atcgcgctgt tagcgggccc attaagttct 2520 
cataaatatc tcactcgcaa tcaaattcag 2580 
gccatgtccg gttttcaaca aaccatgcaa 2640 
atgctggttg ccaacgatca gatggcgctg 2700 
ctgcgcgttg gtgcggatat ctcggtagtg 2760 
tatatcccgc cgttaaccac catcaaacag 2820 
gaccgcttgc tgcaactctc tcagggccag 2880 
tcactggtga aaagaaaaac caccctggcg 2 940 
ttggccgatt cattaatgca gctggcacga 3000 
gcgcaacgca attaatgtaa gttagctcac 3060 
cttgagagcc ttcaacccag tcagctcctt 3120 
cgcacttatg actgtcttct ttatcatgca 3180 
ggtcattttc ggcgaggacc gctttcgctg 3240 
ggtattcgga atcttgcacg ccctcgctca 3300 
tttcggcgag aagcaggcca ttatcgccgg 3360 
gctcctgtcg ttgaggaccc ggctaggctg 3420 
atcaccgata cgcgagcgaa cgtgaagcga 3480 
aacaacatga atggtcttcg gtttccgtgt 3540 
gccctgcacc attatgttcc ggatctgcat 3600 
acctacatct gtattaacga agcgctggca 3660 
ccgcatccat accgccagtt gtttaccctc 3720 
atcagtaacc cgtatcgtga gcatcctctc 3780 
cagaaatccc ccttacacgg aggcatcagt 3840 
ggcccgcttt atcagaagcc agacattaac 3900 
ggatgaacag gcagacatct gtgaatcgct 3960 
ctgcctcgcg cgtttcggtg atgacggtga 4020 
ggtcacagct tgtctgtaag cggatgccgg 4080 
gggtgttggc gggtgtcggg gcgcagccat 4140 
tactggctta actatgcggc atcagagcag 4200 
gtgaaatacc gcacagatgc gtaaggagaa 4260 
cgctcactga ctcgctgcgc tcggtcgttc 4320 
aggcggtaat acggttatcc acagaatcag 4380 
aaggccagca aaaggccagg aaccgtaaaa 4440 
tccgcccccc tgacgagcat cacaaaaatc 4500 
caggactata aagataccag gcgtttcccc 4560 
cgaccctgcc gcttaccgga tacctgtccg 4620 
ctcatagctc acgctgtagg tatctcagtt 4680 
gtgtgcacga accccccgtt cagcccgacc 4740 
agtccaaccc ggtaagacac gacttatcgc 4800 
gcagagcgag gtatgtaggc ggtgctacag 4860 
acactagaag gacagtattt ggtatctgcg 4920 
gagttggtag ctcttgatcc ggcaaacaaa 4980 
gcaagcagca gattacgcgc agaaaaaaag 5040 
cggggtctga cgctcagtgg aacgaaaact 5100 
aaactgtctg cttacataaa cagtaataca 5160 
acgtcttgct ctaggccgcg attaaattcc 5220 
tgggctcgcg ataatgtcgg gcaatcaggt 5280 
gatgcgccag agttgtttct gaaacatggc 5340 
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aaaggtagcg ttgccaatga tgttacagat 
tttatgcctc ttccgaccat caagcatttt 
accactgcga tccccgggaa aacagcattc 
gaaaatattg ttgatgcgct ggcagtgttc 
aattgtcctt ttaacagcga tcgcgtattt 
aacggtttgg ttgatgcgag tgattttgat 
gtctggaaag aaatgcataa acttttgcca 
gatttctcac ttgataacct tatttttgac 
ggacgagtcg gaatcgcaga ccgataccag 
gagttttctc cttcattaca gaaacggctt 
atgaataaat tgcagtttca tttgatgctc 
cggatacata tttgaatgta tttagaaaaa 
ccgaaaagtg ccacctgaaa ttgtaaacgt 
ttgttaaatc agctcatttt ttaaccaata 
aaaagaatag accgagatag ggttgagtgt 
aaagaacgtg gactccaacg tcaaagggcg 
acgtgaacca tcaccctaat caagtttttt 
gaaccctaaa gggagccccc gatttagagc 
aaaggaaggg aagaaagcga aaggagcggg 
gctgcgcgta accaccacac ccgccgcgct 
gcca 



gagatggtca gactaaactg gctgacggaa 5400 
atccgtactc ctgatgatgc atggttactc 5460 
caggtattag aagaatatcc tgattcaggt 5520 
ctgcgccggt tgcattcgat tcctgtttgt 5580 
cgtctcgctc aggcgcaatc acgaatgaat 5640 
gacgagcgta atggctggcc tgttgaacaa 5700 
ttctcaccgg attcagtcgt cactcatggt 5760 
gaggggaaat taataggttg tattgatgtt 5820 
gatcttgcca tcctatggaa ctgcctcggt 5880 
tttcaaaaat atggtattga taatcctgat 5940 
gatgagtttt tctaagaatt aattcatgag 6000 
taaacaaata ggggttccgc gcacatttcc 6060 
taatattttg ttaaaattcg cgttaaattt 6120 
ggccgaaatc ggcaaaatcc cttataaatc 6180 
tgttccagtt tggaacaaga gtccactatt 6240 
aaaaaccgtc tatcagggcg atggcccact 6300 
ggggtcgagg tgccgtaaag cactaaatcg 6360 
ttgacgggga aagccggcga acgtggcgag 6420 
cgctagggcg ctggcaagtg tagcggtcac 6480 
taatgcgccg ctacagggcg cgtcccattc 6540 

6544 



<210> 41 
<211> 426 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> designed protein 
<400> 41 

Met Gly He Glu Glu Gly Lys Leu Val He Trp He Asn Gly Asp Lys 

1^5 10 15 

Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 

20 25 30 

Gly He Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 

35 40 45 

Pro Gin Val Ala Ala Thr Gly Asp Gly Pro Asp He lie Phe Trp Ala 

50 55 60 

His Asp Arg Phe Gly Gly Tyr Ala Gin Ser Gly Leu Leu Ala Glu lie 
65 70 75 80 

Thr Pro Asp Lys Ala Phe Gin Asp Lys Leu Tyr Pro Phe Thr Trp Asp 

85 90 95 

Ala Val Arg Tyr Asn Gly Lys Leu lie Ala Tyr Pro He Ala Val Glu 

100 105 110 

Ala Leu Ser Leu lie Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 

115 120 125 

Thr Trp Glu Glu He Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 

130 135 140 

Lys Ser Ala Leu Met Phe Asn Leu Gin Glu Pro Tyr Phe Thr Trp Pro 
145 150 155 160 

Leu He Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 

165 170 175 

Tyr Asp He Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 

180 185 190 

Leu Thr Phe Leu Val Asp Leu He Lys Asn Lys His Met Asn Ala Asp 
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Thr Asp 
210 
Met Thr 
225 

Val Asn 
Lys Pro 
Asn Lys 



Glu Gly 
290 
Leu Lys 
305 

Thr Met 

Met Ser 

Ser Gly 

Ser Ser 
370 
Trp Arg 
385 

Leu Arg 
Ser Gly 



195 

Tyr Ser 

He Asn 

Tyr Gly 

Phe Val 
260 
Glu Leu 
275 

Leu Glu 



He Ala 

Gly Pro 
230 
Val Thr 
245 

Gly Val 



200 
Glu Ala 
215 

Trp Ala 
Val Leu 
Leu Ser 



Ala Phe 
Trp Ser 



Asn Lys 
220 
Asn He 
235 

Phe Lys 



Ser Tyr 
Glu Asn 



Ala Phe 
340 
Arg Gin 
355 

Ser Gly 



Ala Lys 

Ala Val 

Glu Glu 
310 
Ala Gin 
325 

Trp Tyr 



Glu Phe 
280 
Asn Lys 
295 

Glu Leu 
Lys Gly 
Ala Val 



Pro Thr 
250 
Ala Gly 
265 

Leu Glu Asn Tyr 



He Asn 



Asp Lys 
Ala Lys 



Pro Leu 
300 
Asp Pro 
315 

Met Pro 



205 

Gly Glu 

Asp Thr 

Gly Gin 

Ala Ala 
270 
Leu Leu 
285 

Gly Ala 



Thr Ala 

Ser Lys 
240 
Pro Ser 
255 

Ser Pro 
Thr Asp 
Val Ala 



Gly Gly 

Ala Arg 

His His 
420 



Thr Val 

Gly Ser 

His Val 
390 
Leu Glu 
405 

His His 



Asp Glu 
360 
Gly Ser 
375 

Val Glu 
His His 
His His 



Glu He 
330 
Arg Thr 

Ala Leu Lys Asp 



Ala Val 



Gly Met 

Gly Leu 

Pro Gin 
410 
Glu Phe 
425 



Asp Glu 
380 
Ala Gly 
395 

Gly Gin 



Arg He 

Asn He 

He Asn 
350 
Ala Gin 
365 

Lys Thr 



Ala Ala 
320 
Pro Gin 
335 

Ala Ala 
Thr Asn 
Thr Gly 



Glu Leu 
Arg Glu 



Glu Gin 
400 
Pro Gly 
415 
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